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TELOMERASE 



TEMPLATE 

CCAAAACCCCAAAAC 
tTtT GGGG-3- 



PANEL A 




TELOMERASE 



TEMPLATE 

CCAAAACCCCAAAAC 
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1651 
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1751 



CCCCAAAACC 
GTAGTTTAGA 
TTTGGATGAT 
GCTCTTGTAG 
TCGCCTTCAT 
CTCGGATGCA 
AAAGCGGAGA 
AATCAGGTAA 
TTATGGAGAA 
CCTAGCCAAC 
GATCTCGATA 
AAAAAGTTTG 
TATGGGTTTT 
TTGAGACAAT 
AGTTCTGATG 
CTCAATTTAA 
TAATGGAATA 
TTGGATTCTT 
TTACAACAGA 
AAAAGAAGCA 
TTGTTG ATT C 
AAAGAAGAAG 
AGGTAAGAGA 
C AC AG CTGTT 
GTAAAAAGTA 
TCACATTCAT 
GCAGTCATCC 
AAGAAATGGA 
AAGAATCGAA 
ATTGAAGAGA 
GTAACTTTTA 
CGATCTTCAA 
ATACAAACCT 
AAAGAAAAAA 
AAGATTTATT 
GGGGTTTTGG 



CCAAAACCCC 
AATAAAATAT 
ATAGAAAATT 
TGACAAGAAA 
TGACTATTCC 
AAT CTTT AT A 
GCAAAGAGTA 
TGAGGATTAT 
AATTACTTAA 
AATGATGAGT 
CATCAGACTT 
ATAATCGAAC 
ATTACAATTG 
TGAAAAAGCT 
TGTATGCCAT 
TGGATAGCTA 
TACGTTAAAT 
AAAG CAT AG A 
TT AC CTGTTT 
GGCGAAATGA 
TTCTGTAACC 
AGCTATCACA 
GAT AC AT T C A 
ATTTT CTTTT 
TCAAATAAGA 
AGATCGACCT 
GTTTTAAAAA 
GCCGAAATCT 
CTCTAAATCT 
TTGACGAGGC 
TTAATTAGAG 
TTGACGAAAT 
TGGTCAAAAT 
TAAGGCAATA 
TTTTTCAATA 
GG 



AAAACCCCTA 
TATTCCCGCA 
TACTTCCTAA 
GGATGCAAAA 
AAAGTTGCAA 
ACGATTCTTT 
GAAATTGAAA 
TCTATTTTTT 
TACTAAAAGG 
ATATTAAATT 
ACCAAAGACA 
AGCAGAAGAA 
TTTTAGGTAT 
GTTTACAACT 
TATTTTGTGA 
TAGAAACAAA 
CCTTTGGGAC 
TACACAGAAT 
TGATT ACT CT 
AAAGAAGACT 
GG AATTAACA 
ATCCTGATTC 
TTAAAATTCA 
ATCTTAACAA 
GAAGCGCTAG 
T CAT AT AT C C 
TAGTGCTATG 
TAATCAAAAA 
TTCGTTAATA 
AACTGCACAG 
AATAAACTAA 
AAAAGCTGAA 
ATTGAGGAAG 
AATAAAATGA 
ATTTATTGAA 



TAAAAAAAGA 
CAAATGGAGA 
TACATTCAAC 
CATTGAAATC 
AAACAATTAG 
CTTGAGAAAA 
CATTACTAAT 
AGATCACTTC 
TAAACAGTTT 
CATATGAGAA 
AACTCGCTAT 
CTTATTGCAT 
CGACGGTGAA 
GAAGGAATCG 
ATTAATCTCA 
CCAAATAAAC 
AAATGCACAC 
GCTTTAGAGA 
TGCTCATCTC 
AAAGAAAGAG 
ACAAGAATAT 
TTAAAGATTT 
TATATTATAG 
TATTTTTTGA 
ACTGAGGTAA 
AATACGATGA 
AGG ACT AAAT 
GAATTGCGTC 
AGTATTACCA 
AAG AT CAT T A 
ATTACTAATA 
CTAAAGTTAG 
GAAAAGAAGA 
GTACAGAAGT 
AAGAGGGGTT 



AAAAATTGAG 
TGGATATTGA 
AAG T AT AG C A 
TGGCTCGAAA 
AGTTCTACTT 
TTAGTTTTAA 
GTTTAAATAA 
TTAAGGAGCA 
GGATTATTTC 
TGAGTCAAAG 
AAAACGCAAG 
TTACTATTCG 
CTCCCGAGTC 
CAGTTCTGAA 
AATATCTTAT 
CATGCAAGTT 
TGAATTTATA 
CTGATTTAGC 
TTATATCTTT 
ATTTCAAAAT 
TAGCAACGAA 
CAAAAATTCC 
TTTTTCATTT 
TTAGCTGGAA 
CTTAGCTTAT 
TAAGGAAACA 
TTTTAGAGTC 
GATATTGCAA 
ATCTTGATTG 
AAGAAATAAA 
TAG AG AT C AG 
ACAATAAAAA 
CCAGTTAGCA 
GAAGAAATAA 
TTGGGGTTTT 
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1 AAAACCCCAA 

51 AACCTCAGTA 

101 ATGGAGGTTG 

151 TCTTAAGACT 

201 TCCAGAAAGT 

2 51 TTAGAAGATA 

3 01 AGACTATAAT 

3 51 CAACTGGACT 

4 01 TCATCAAGCG 

4 51 ACTTAAGGGA 
501 CTCAAAAGCA 

5 51 ATTGGAAATG 
601 GCGAACTTCT 
651 AT C ATTTGAA 
701 GCAGACATGA 
751 GAATGAGAAA 
8 01 ATATGAAATC 
8 51 AACCAATTCT 
901 AGCGATGGAC 
951 GAATTAGAAA 

10 01 CTTGAGAAAG 

1051 TCCTCTTCCA 

1101 TAAATAAAAC 

1151 TACACAACTG 

1201 C AAT AT ACT C 

12 51 AAAAGAAAGT 

13 01 AAAAACTTAT 

13 51 GGTTGAGACC 

14 01 ACGTCTTATG 
14 51 CTGATTAGAT 
1501 AAC CT ATT AC 
1551 TCGCAGACTT 

16 01 GAAGAATGGA 
1651 AC CGAAGAAA 

17 01 T AAATT C AG A 
17 51 AACTCTCACT 
1801 TTTTGGATTC 
1851 AGTTTGTTTG 
1901 ACTATGGATA 
1951 AACATTCCTA 
2 001 CTGCACAAAT 
2 051 TTTAGAAAGA 
2101 TGCACTTGAA 
2151 AACAAAATGA 
2 2 01 AGAAATTATT 
2 2 51 C C AAT AT AAT 
2 301 GAATTCCTCA 
2 3 51 G C AAC ATT AG 



AACCCCAAAA 
TTAATAAGCT 
ATGTTGATAA 
TGTGAAGAAA 
TATTAGATGA 
TTAAAATATT 
GAAGAAGATT 
AATGAT CGAA 
ATGTTTCAGA 
AATCAATTAG 
GTATTTCTTT 
AGCTCTTCCG 
GAAGGAACTC 
AGT CAACGAT 
ATGAACCTCG 
GATCACTTTC 
AAGAACCAGA 
TCAAAAAGCA 
AGAGCTCAGA 
GAAGCTAAAA 
TCAAAGATTT 
GAAAATTGGC 
TAGAGAAGAA 
ATAATAAATG 
C C C AAAG ACT 
TAAGAAATAT 
TGCTTGAGAA 
TCTGCAAAGC 
GAAATTGCTC 
GATTTTTCTA 
TACAGAAAGA 
AAAGAAGGAA 
AAAAGT CGCT 
ACTACTTTCC 
CCGGAAGACT 
TAATGCTTAA 
GCTGTTTTTA 
CAAATGGAAG 
TCGAAAAGTG 
AAAACTACTA 
T CT AAAG AG A 
AAGAAATGAA 
GGAGGACAAT 
CTTAAATGCA 
TTAAGAAAGA 
TACATTAACT 
AGGTCTTTGA 
AGGAAAGCTC 



CCCCTTTTAG 

CAGATTTTAA 

TCAAGCTGAT 

TTAAAGAAGC 

AGAAAT CAAT 

TGCGCAGACA 

TTAAAGTTAT 

CTTATTGACA 

TAGACAAAAA 

CAAAGACCCA 

CAAGACGAAT 

ACATCTCTAC 

TTGTTCAATT 

AAGTTTGACA 

ATGTTGATCA 

TCAACAACAT 

ATATTTTATT 

TGAGTTTGTG 

CGATATT CAC 

GATAAGGTTA 

TAACTTCAAC 

GGGAACGGAA 

AAGTCGAAGT 

CGTCACACAA 

TTTTGACTGG 

GTGGAACTAA 

GAT CAAT AC A 

ATTTTTATTA 

CGATGGATAT 

TGTCACCGAG 

ATATTTGGGA 

ACGCTTGCTG 

TGGATTTGCA 

GTCCAATTAT 

ACAAAATTAA 

G AC AT TGAAG 

ACTATGATGA 

CAAGTTGGAC 

ATATGATAGT 

AATTACTTTC 

AAGAATAACA 

AGATTATTTT 

ATCCAACCTT 

AAGAAAACAT 

TAACTTACTT 

TTAATGGGAA 

GTTTCATCAA 

CTTAGGATTC 



AGCCCTGCAG 

ATATTAATTA 

AATCATGGCA 

TAAAACGTTG 

CT CAAAGT C A 

AATATTGTTG 

TGCAAGAAAA 

AATGCTTAGT 

CTTCAATGAT 

TTTATTAACA 

GGAACCAAGT 

ACTAAATATT 

TTGCGGGAAT 

AAAAGCAAAA 

ACCTGCAAAT 

CAACGTGCCG 

GCACTCATTT 

AGTAACAAAA 

GAATAT ATT C 

TCGAAAAAAT 

TACT AT TT AA 

ACAAAAAATC 

ACTATGAAGA 

TTTATTAATG 

AAGAAACCGT 

ACAAGCATGA 

AGAGAAATAT 

TTTTGATCAC 

TCGAGGATCT 

CAACAGAAAA 

CGTCATTATG 

AGGTCCAAGA 

CCTGGAAAAC 

GACTTT CAAT 

C T AC AAAT AC 

AATAGAATGT 

T GT AAT G AAA 

AACCAAAACT 

GTAAACAGAG 

TTCAGATTTC 

TAGTTATCGA 

AGACAGAAAT 

ATTCAGTGTT 

TAATTGTTGA 

C AAC C AGT C A 

GTTTTATAAA 

TTTTGTCATC 

CTTAGAGATG 



TTGGAAATAT 

CAAAACCTAA 

TTCACTCAGC 

TACTCTTGGA 

TTATAAAGAT 

CTACTCCACG 

GAAGTATTTT 

TGAACTT CTT 

TTGGATTTCA 

GCTCTTTCAA 

TAGAGCAATG 

T AAT ATT CCA 

AACGTTTTTG 

AGGTGGAGCA 

ACAATGTCAA 

AATTGGAATA 

TAATAGAAAT 

ACAATATTTC 

AGATTTAATA 

TGCCTACATG 

CAAAATCTTG 

GAAAACTTGA 

GCTGTTTAGC 

AATTTTTCTA 

AAGAATTTTC 

ACTCATTCAC 

CATGGATGCA 

GAAAACATCT 

CGTCGTCTCG 

GTTACTCCAA 

AAAATGT C AA 

AAAAGAGGTT 

TC AG ACT AAT 

AAGAAGATTG 

GAAGTTATTG 

TTAAAGATCC 

AAGTATGAGG 

CTTCTTTGCA 

AAAAACTATC 

TGGATTATGA 

TTCGAAAAAC 

TCCAGAAGAT 

CTTGAAAATG 

AG C AAAG C AA 

TTAATATTTG 

CAAACAAAAG 

ATTTTATTAT 

AAT CAATGAA 
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24 01 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 

24 51 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 
2 501 ATAAACGTAA GT CGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 

25 51 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 
2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 

26 51 TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 
2 701 AGAAGGAATT CTGTGTACAC T CAAT CT AAA CATGCAAACA AAGAAAGCAT 
2751 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
2 801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
2851 CAACAAGTTA TTT AT AT CAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 

2 901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 
2951 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 

3 001 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
L-3 3 051 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAAT AT T CAGCACAAAA 
I J! 3101 AAGTACATTT TCAACAGAGT TTGCATGATC CT CAAGGCAA AAG AAG C AAA 
£ 3151 GCTAAAAAGT GACCAATGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 

3 201 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 

3 2 51 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 



m 



FIG. 9 

3 (CONTINUED) 

2 

□ 

j>y 1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 

V 51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 

101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 

151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 

201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR I FYCTHFNRN 

251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 

301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 

3 51 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 

401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 

451 LIRCFFYVTE QQKSYSKTYY YRKNIWDVIM KMS I ADLKKE TLAEVQEKEV 

501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 

551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 

601 TMDIEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSRN 

651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 

701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 

751 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 

801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 

851 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 

901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 

951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 

1001 KY I FNRVCM I LKAKEAKLKS DQCQSLIQYD A 
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CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 

! + + + + + + 60 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTCCATCAAATCT 

a ?QNPKTPKPL*KKKKLR*FR 

b PKTPKPQNPYKKRKN*GSLE- 

C PKPQN PKTPI KKEKI E V V * K - 

AATAAAATATTATTCCCGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 

61 + + . + + + 120 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTATATCTTTTAA 

a NKILFPHKWRWILIWMI*KI 

b IKYYSRTNGDGY*FG*YRKF- 

c * N I I PAQMEMDIDLDDIENL- 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

121 + + + + + + 180 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

a YFLIHSTS IAALVVTRKDAK 

b TS*YIQQV*QLL* *QERMQN- 

c LPNTFNKYSSSCSDKKGCKT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

181 --- + + + + ----- + 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

a H*NLARNRLH* LFQSCKNN* 

b IE1WLEIAFIDYSKVAKTIR- 

C LKSGSKS PSLTI PKLQKQLE- 

AGTT CT ACTTCT CGG ATGC AAATCTTT AT AACG ATT CTTTCTTG AGAAAATT AGTTTT AA 

241 - + + + + + + 300 

TCAAGATGAAGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAATCAAAATT 

a SSTSRMQIFITILS*EN*F* 

b VLLLGCKSL*RFFLEKISFK- 

c FYFS DANLYNDS FLRKLVLK- 

AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

301 + + + + + +■ 360 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT • 

a KAESKE*KLKHY*CLNKIR* 

b KRR'AKSRN*NITNV*IKSGN- 

c SGEQRVEIETLLMFK*NQVM- 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 

361 + + + + + 420 

ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

a *GLFYFLDH FLRSIMEKIT* 

b EDYS I F * ITS # GALWRKLLN- 

c RIILFFRSLLKEHYGENYLI- 

F/G. 12 
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TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 

421 + + + 4- + + 480 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 

a Y*KVNSLDYFPSQQ**VY*I 

b TKR*TVWI ISLANNDEYIKF- 

c LKGKQFGLFP* PTMMS I L N S - 

CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 + + + + + + 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

a HMRMSQRIS IHQTYQRQTRY 

b I * E *VKGSRYIRLTKDKLAI 

c YENESKDLDTSDLPKTNSL*- 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 

541 + + + + + + 600 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 



a KTQEKV* *SNSRRTYCIYYS 
; Lf b KRKKKFDNRTAEELIAFTIR- 

¥ C NARKSLI IEQQKNLLHLLFV- 

^ TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

601 + + + + + + 660 

"* ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 



a YGFYYNCFRYRR*TPES*DN 

b MGFI'TIVLGIDGELPSLETI 

c WVLLQLF *VSTVNSRVLRQL- 

TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

661 + + + + + + 720 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

a * KSCLQLKESQF* K F * • C V C H 

b EKAVYN*RNRSSESSDVYAI 

c KKLFTTEGIAVLKVLMCMPL- 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

721 + + + + + + 780 

ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 

a YFVN*SQISYLNLMDSYRNK 

b IL*INLKYLISI*WIAIETN- 

C FCELISNILSQFNG* L * KQT- 

CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

731 + + + + + + 840 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

a PNKPCKFNGIYVKSFGTNAH 

b QINHASLMEYTLNPLGQMHT- 

c K*TMQV*WNIR* ILWDKCTL- 

TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 

841 + + + + + + 900 

ACTTAAATATAACCTAAGAATTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 

a * IYIGFLKHRYTECFRD*FS. 

b EFILDS*SIDTQNALETDLA- 

c NLYWILKA* IHRML*RLI * L - 
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TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

901 + + + + t-- + 960 

AATGTTGTCTAATGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

a LQQITCFDYSCSSLI SLKEA 

b YNRLPVLITLAHLLYL* KKQ- 

c TTDYLF*LLLLISYI FKRSR- 

GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

961 + + + + + + 1020 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a GEMKRRLKKEISKFVDSSVT 

b AK*KED*RKRFQNLLILL*P- 

C RNEKKTKERDFKIC* FFCNR- 

i«* GGAATT AAC AAC AAG AAT ATT AGC AAC G AAAAAG AAG AAG AGCT AT C AC AATC CTG ATTC 

Q 1021 + + + + + + 1080 

} *5 CCTTAATTGTTGTTCTTATAATCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

! H a GINNKNISNEKEEELSQS*F 

,£ b ELTTRILATKKKKS YHNPDS 

l|% c N*QQEY*QRKRRRAITILIL- 

V~ TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

|<* 1081 + + + + + + 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

O a LKISKIPGKRDTFI KIHIL* 

IP b * RFQKFQVREIHSLKF I YYS- 

|«j c KDFKNSR*ERYIH*NSYIIV- 

! .y TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

□ 1141 + + + + + + 1200 

j y AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSFILTIFFD*LE 

b FSFHSCYFLLS*QYFLISWK- 

C FHFTAVI FFYLNNI F * L A G S - 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 + + + + + + 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSI K*EKR*TEVT* LIHIH 

b * KVSNKRSARLR * LS L FTFI 

C KKYQ IREALD*GNLAYSHS * - 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 + + + + + --+. 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

a RSTFIYPIR** GNSSHPF*K.- 
b DRPSYIQYDDKETAVI RFKN- 

C I DLHI SNTMIRKQQS SVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

1321 + + -- + + + + 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

a * C Y E D * IFRVKKWSRNLNQK 

b SAMRTKFLESRNGAE I L I K K - 
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MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 
VNVTAACLLQEGSYYQDKDERRYI ITKALLEVAESDPEF ICQLA 
VY I RNEL Y I RTTTNY I VAFCWHKNTQ P F I EKYFNKAVLL PNDL 
LEVCEFAQVLYI FDATEFKNLYLDRILSQD IRKELTFRKCLQRC 
VRSKFSEFNEYQLGKYCTESQRKKTMFRYLSVTNKQKWDQTKKK 
RKENLLTKLQAIKESEDKSKRETGDIMNVEDAIKALKPAVMKKI 
AKRQNAMKKHMKAPKIPNSTLESKYLTFKDLIKFCHISEPKERV 
YKILGKKYPKTEEEYKAAFGDSASAPFNPELAGKRMKIEISKTW 
ENELS AKGNTAEVWDNL I S SNQL P YMAMLRNLSN I LKAGVS DTT 
HSIVINKICEPKAVENSKMFPLQFFSAIEAVNEAVTKGFKAKKR 
ENMNLKGQ I EAVKE WEKTDEE KKDMELEQTEEGEFVKWEG IG 
KQYINS IELAI KIAVNKNLDEIKGHTAI FSDVSGSMSTSMSGGA 
KKYGSVRTCLECALVLGLMVKQRCEKSSFYIFSSPSSQCNKCYL 
EVDLPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHV 
DNIVILSDMMIAEGYSDINVRGSSIVNSIKKYKDEVNPNIKIFA 
VDLEGYGKCLNLGDEFNENNY I KI FGMS DS I LKF I S AKQGGANM 
VEVI KNFALQKIGQK 

FIG. 20 

MS RRNQKKPQAP IGNETNLDFVLQNLEVYKSQ I EH YKTQQQQ I K 
EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRVNQIKQ 
QVQLIKKVGSKVEKDLNLNEDENKKNGLSEQQVKEEQLRTITEE 
QVKYQNLVFNMDYQLDLNESGGHRRHRRETDYDTEKWFEISHDQ 
KNYVS I YANQKTSYCWWLKDYFNKNNYDHLNVS INRLETEAEFY 
AFDDFSQTIKLTNNSYQTVNIDVNFDNNLCILALLRFLLSLERF 
NILNIRSSYTRNQYNFEKIGELLETI FAWFSHRHLQGIHLQVP 
CEAFQYLVNSSSQISVKDSQLQVYSFSTDLKLVDTNKVQDYFKF 
LQEFPRLTHVSQQAI PVSATNAVENLNVLLKKVKHANLNLVS I P 
TQFNFDFYFVNLQHLKLEFGLEPNILTKQKLENLLLSIKQSKNL 
KFLRLNFYTYVAQETSRKQILKQATTIKNLKNNKNQEETPETKD 
ETPSESTSGMKFFDHLSELTELEDFSVNLQATQEI YDSLHKLLI 
RSTNLKKFKLSYKYEMEKSKMDTFIDLKNIYETLNNLKRCSVNI 
SNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQNEFQFN 
NVKSAKIESSSLESLEDIDSLCKSIASCKNLQNVNI IAS LL YPN 
NIQKNPFNKPNLLFFKQFEQLKNLENVS INCILDQHILNS ISEF 
LEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPELNQVYINQ 
QLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDFD 
QNTVSDDS IKKILES ISESKYHHYLRLNPSQSSSLIKSENEEIQ 
ELLKACDEKGVLVKAYYKFPLCLPTGTYYDYNSDRW 

FIG. 22 



MKILFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 
PNSRKIALPCLPGDLSHKAVIDHCI IYLLTGELYNNVLTFGYKI 
ARNEDVNNSLFCHSANVNVTLLKGAAWKMFHSLVGTYAFVDLLI 
N YT V I Q FNGQ FFTQI VGNRCNE PHLPP KWVQRS S S S S AT AAQ I K 
QLTEPVTNKQFLHKXiNINSSSFFPYSKILPSSSS IKKLTDLREA 
IFPTNLVKI PQRLKVRINLTLQKIiLKRHKRLNYVS ILNS ICPPL 
EGTVLDLSHLSRQSPKERVLKFI IVILQKLLPQEMFGS KKNKGK 
IIKNLNLLLSLPLNGYLPFDSLLKKLRLKDFRWLFISDIWFTKH 
NFENLNQLAICFISWLFRQLI PKI IQTFFYCTEISSTVTI VYFR 
HDTWNKL I TP F I VEYFKTYLVENNVCRNHNS YTLSNFNHS KMR I 
IPKKSNNEFRI IAIPCRGADEEEFTIYKENHKNAIQPTQKILEY 
LRNKRPTSFTKIYSPTQIADRIKEFKQRLLKKFNNVLPELYFMK 
FDVKSCYDS IPRMECMRILKDALKNENGFFVRSQYFFNTNTGVL 
KL FNVVN AS RVP KP YEL YI DNVRTVHLSNQD VINWEME I F KTA 
LWVEDKCYIREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS 
QDTLILKLADDFLI ISTDQQQVINI KKLAMGGFQKYNAKANRDK 
ILAVSSQSDDDTVIQFCAMHIFVKELEVWKHSSTMNNFHIRSKS 
SKGIFRSLIALFNTRISYKTIDTNLNSTNTVLMQIDHWKNISE 
CYKSAFKDLS INVTQNMQFHSFLQRI IEMTVSGCPITKCDPLIE 
YEVRFTI LNGFLESLSSNTSKFKDNI ILLRKEIQHLQAYI YI YI 

HIVN ^ 
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Euplotes 



LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 



FIG. 24 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAA AGCAGT CATTGATCACTGCAT CATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 
ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 
TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 
AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 
TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 
GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 
TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 
TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 
TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGT'tTAAT 
AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 
ATTTAAAGAT AAT AT C ATT CTTTTG AGAAAGGAAATTCAACACTTG CAAG C 
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AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQS IGIRQHLKR 
VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYWGARTFRREK 
RAERLTSRVKALFSVLNYERA 

FIG. 27 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 
TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAG 
AGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGC 
CAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACGGGC 
TGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGA 
GAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCT 
CAACTACGAGCGGGCGCG 

FIG. 28 



MTEHHTPKSRILRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTWGF 
DSKPDEGVQFSSPKCSQSELIANWKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLVSTF 
PNYLIS ILESKNWQLLLEI IGSDAMHYLLSKGS IFEALPNDNYLQISGI PLFKNNVFEETVSKKRKR 
T I ETS ITQNKS ARKEVSWNS I S I SRFS I FYRSS YKKFKQDLYFNLHS ICDRNTVHMWLQW IFPRQFG 
LINAFQVKQLHKVIPLVSQSTWPKRLLKVYPLIEQTAKRLHRISLSKVYNHYCPYIDTHDDEKILS 
YSLKPNQVFAFLRS ILVRVFPKLIWGNQRI FEI ILKDLETFLKLSRYES FSLHYLMSNI KISEIEWL 
VLGKRSNAKMCLSDFEKRKQI FAEFIYWLYNSFI IPILQSFFYITESSDLRNRTVYFRKDIWKLLCR 
PFITSMKMEAFEKINENNVRMDTQKTTLPPAVIRLLPKKNTFRLITNLRKRFLIKMGSNKKMLVSTN 
QTLRPVAS ILKHLINEESSGI PFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLM 
FRIVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKWQLLSMKTSDTLFVDFVDY 
WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQBCVGI PQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 
LRVVDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENSNGI INNTFFNESKKRMPFFG 
FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCN 
IYRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 
LGMRDGLKPS FKYHPCFEQL I YQFQSLTDLI KPLRPVLRQVLFLHRRI AD 



FIG. 29 
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1: FFY VTE TTF QKN RLF FYR KSV WSK 

2 : RQH LKR VQL RDV SEA EVR QHR EA 

3 : ART FRR EKR AER LTS RVK ALF SVL NYE 



1 : AKF LHW LMS VYV VEL LRS FFY VTE TTF Q 

2: LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 

3: PAL LTS RLR FIP KPD GLR PIV NMD YW 

FIG. 32 
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5'- cag acc aaa gga att cca taa gg -3' 
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4 (B ' ) 



5(C) 

D D Y L L I T 
ctg ctg atg gag gag tag tgg -5' 
a a aaaaa a a 
t t t t 
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FIG. 34 
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Vector Genomic DNA Insert Vector 
A5 [ j j j j I j j ) — 

B2--- ( j j j j j j j i M 

5825 bp Sequenced E^iSM 1 | 

2 kb Hind III Fragment USUI 

F/G. 33/\ 



fez7 + 



RT Motifs 12 3(A) 4{B')5(CL6(C) 



/lotifs 12 3(A) 4(B 

ar .. jIb ei lift 



hi 



56 7 8 9 10 11 12 131415 



Introns 1 2 3 



Hind III Xca I Xca I Hind ill 
Original PCR m 

2-3 & 5-20 cDNA Bffftfeft^^ 

5' RT-PCR W/M2-B14 

EmrtSSa^ 5' RT-PCR W/M2-B1 5 Band A 
5* RT-PCR W/M2-B15 Band B 
5* RT-PCR W/M2-B16 Band C 



500 bp 



F/G. 33B 
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Wild Type Telomerase Gene 
Wild Type Telomerase Gene 



1. Transform with linear fragment 
containing the telomerase gene 
disrupting with a LEU2 or ura4 marker 



M 



Telomerase Gene::M 



Wild Type Telomerase Gene 




Wild Type Telomerase Gene 



Wild Type (X2) 



2. Assay in selective media 

3. Sporulate, and grow on 
selective media 




Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 
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i 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAG GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
|. A GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

2 

! =«* glu ala glu val arg gin his arg glu ala arg pro ala leu leu 

!.H GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

[,£ 

i; 90 

j|3 ile val asn met asp tyr val val gly ala arg thr phe arg arg 



thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 



ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 



,,J 100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

!;□ GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

ru 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile, pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 

. 190 200 

thr tyr cys val arg arg tyr ala val val gin lys ala ala met 
ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC ATG 
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gly thr ser 
GGC ACG TCC 

220 

gin gly ile 
CAG GGG ATC 



leu cys tyr 
CTG TGC TAC 

250 

arg asp gly 
CGG GAC GGG 



thr pro his 
ACA CCT CAC 

280 

arg gly val 
CGA GGT GTC 



val val asn 
GTG GTG AAC 

310 

phe val gin 
TTT GTT CAG 



leu leu asp 
CTG CTG GAT 

340 

tyr ala arg 
TAT GCC CGG 



phe lys ala 
TTC AAG GCT 

370 

arg leu lys 
CGG CTG AAG 



210 

ala arg pro 
GCA AGG CCT 



pro gin gly 
CCG CAG GGC 

240 

gly asp met 
GGC GAC ATG 



leu leu leu 
CTG CTC CTG 

270 

leu thr his 
CTC ACC CAC 



pro glu tyr 
CCT GAG TAT 

300 

phe pro val 
TTC CCT GTA 



met pro ala 
ATG CCG GCC 

330 

thr arg thr 
ACC CGG ACC 



thr ser ile 
ACC TCC ATC 

360 

gly arg • asn 
GGG AGG AAC 



cys his ser 
TGT CAC AGC 



ser arg ala 
TCA AGA GCC 



ser ile leu 
TCC ATC CTC 



glu asn lys 
GAG AAC AAG 



arg leu val 
CGT TTG GTG 



ala lys thr 
GCG AAA ACC 



gly cys val 
GGC TGC GTG 



glu asp glu 
GAA GAC GAG 



his gly leu 
CAC GGC CTA 



leu glu val 
CTG GAG GTG 



arg ala ser 
AGA GCC AGT 



met arg arg 
ATG CGT CGC 



leu phe leu 
CTG TTT CTG 



thr ser tyr 
ACG TCC TAC 

230 

ser thr leu 
TCC ACG CTG 



leu phe ala 
CTG TTT GCG 

260 
asp asD phe 
GAT GAT TTC 



phe leu arg 
TTC CTC AGG 

290 

val asn leu 
GTG AAC TTG 



ala leu gly 
GCC CTG GGT 

320 

phe pro trp 
TTC CCC TGG 



gin ser asp 
CAG AGC GAC 

350 

leu thr phe 
CTC ACC TTC 



lys leu phe 
AAA CTC TTT 

380 

asp leu gin 
GAT TTG CAG 



val gin cys 
GTC CAG TGC 



leu cys ser 
CTC TGC AGC 



gly ile arg 
GGG ATT CGG 



leu leu val 
TTG TTG GTG 



thr leu val 
ACC CTG GTC 



arg lys thr 
CGG AAG ACA 



gly thr ala 
GGC ACG GCT 



cys gly leu 
TGC GGC CTG 



tyr ser ser 
TAC TCC AGC 



asn arg gly 
AAC CGC GGC 



gly val leu 
GGG GTC TTG 



val asn ser 
GTG AAC AGC 



leu gin thr val cys 
CTC CAG ACG GTG TGC 

400 

ala tyr arg phe his 
GCG TAC AGG TTT CAC 



390 

thr asn ile tyr lys 
ACC AAC ATC TAC AAG 



ala cys val leu gin 
GCA TGT GTG CTG CAG 



ile leu leu leu gin 
ATC CTC CTG CTG CAG 

410 

leu pro phe his gin 
CTC CCA TTT CAT CAG 
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gin val trp 
CAA GTT TGG 

430 

arg leu pro 
CGG CTC CCT 



val ala gly 
GTC GCT GGG 

460 

arg ala val 
CGT GCA GTG 



thr pro cys 
ACA CCG TGT 

490 

pro asp ala 
CCA GAC GCA 



420 
lys asn pro 
AAG AAC CCA 



leu leu leu 
CTG CTA CTC 

450 

gly gin gly 
GGC CAA GGG 



ala val pro 
GCT GTG CCA 

480 

his leu arg 
CAC CTA CGT 



ala glu ser 
GCT GAG TCG 
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his phe ser 
CAT TTT TCC 



his pro glu 
CAT CCT GAA 



arg arg arg 
CGC CGC CGG 



pro ser ile 
CCA AGC ATT 



ala thr pro 
GCC ACT CCT 



glu ala pro 
GAA GCT CCC 



cys ala ser 
TGC GCG TCA 

440 
ser gin glu 
AGC CAA GAA 



pro ser ala 
CCC TCT GCC 

470 
pro ala gin 
CCT GCT CAA 



gly val thr 
GGG GTC ACT 

500 

gly asp asp 
GGG GAC GAC 



ser leu thr 
TCT CTG ACA 



arg arg asp 
CGC AGG GAT 



leu arg gly 
CTC CGA GGC 



ala asp ser 
GCT GAC TCG 



gin asp ser 
CAG GAC AGC 



ala asp cys 
GCT GAC TGC 



pro gly gly arg ser 
CCT GGA GGC CGC AGC 

520 

his pro gly leu met 
CAT CCT GGA CTG ATG 



510 

gin pro gly thr ala 
CAA CCC GGC ACT GCC 



ala thr arg pro gin 
GCC ACC CGC CCA CAG 



leu arg leu gin asp 
CTC AGA CTT CAA GAC 

530 

pro gly arg glu gin 
CCA GGC CGA GAG CAG 



540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 

OP : 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 

CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 

AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 



CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



. LWSLIRCFFYVTEQQKSYSKT . 
. FIIPILQSFFYITESSDLRNRT . 
. LIPKIIQTFFYCTEISSTVTIV. 
. YWELLRS FFYVTETTFQKNRL . 
FFY TE 



Motif 0 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



K 

p hhh K hR h R 

. KSLGFAPGKXRLIPKKT- -TFRPIMTFNKKIV 
. QKTTLPPAVIRLLPKKN- -TFRLITNLRKRFL 
. TLSNFNHSKMRI IPKKSNNEFRI IAI PCRGAD 
. ARPALLTSRL.RFIPKPD- -GLRPIVNMDYWG 
R PK R I 



Motif A 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



AF 

h hDh GY h 
. PKLFFATMDIEKCYDSVNREKLSTFLK. 
. RKKYFVRIDIKSCYDRIKQDLMFRIVK. 
. PEL Y FMKFDVKS CYDS I PRMECMR I LK . 
. PELYFVKVDVTGAYDTIPQDRLTEVIA. 

F D YD 



// 



Motif B 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



Motif C 
Ep pl23 
Sp. Tezl 
Sc Est2 
Hs TCP1 
consensus 

Motif D 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



hPQG pS hh 
. NGKFYKQTKGIPQGLCVSSILSSFYYA . 
. GNSQYLQKVGI PQGS I LSSFLCHFYME . 
. EDKCYIREDGLFQGSSLSAPIVDLVYD . 
. RATSYVQCQGIPQGSILSTLLCSLCYG . 
G QG S 

Y 

h F DD hhh 
. PNVNLLMRLTDDYLLITTQENN . . . 
. KKGSVLLRWDDFLFITVNKKD . . . 
. SQDTLILKLADDFLIISTDQQQ . . . 
. RRDGLLLRLVDDFLLVTPHLTH . . . 
DD L 

Gh h cK 
. NVSRENGFKFNMKKL . . . 
. LNLSLRGFEKHNFST . . . 
. KKLAMGGFQKYNAKA . . . 
. LRTLVRGVPEYGCW . . . 

G 
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1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 

51 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 

101 AGCCACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 

151 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 

201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 

251 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 

301 CCGAGTGCTG CAN ANG C TGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 

3 51 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 

4 01 ACCACCAGCG TGCGCAG CTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 
451 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
501 TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
551 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
601 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 

i;3 651 CAACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 

! : 3 7 01 CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 

751 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 

801 CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 

8 51 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 

'In 901 CTCTTTGGAG GGTGCGCTCT CTGGCACGCG CCACTCCCAC CCATCCGTGG 

951 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 

1001 GGGACACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGCA CTTCCTCTAC 

10 51 TCCTCAGGCG ACAAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 

1101 AGGCCCAGCC TGACTGGCGT TCGGGAGGTT CGTGGAGACA NT CTTT CTGG 

J 1151 TTCCAGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 

l!H 12 01 GCGNTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 

□ 12 51 CGCAGTGCCC CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 

|«J 13 01 GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 

{ U 13 51 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 

\f. 14 01 CTGCTCCGCC AGCACAGCAG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 

SU 14 51 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 

15 01 AACGCCGCTT CCTCAGGAAC AC CAAGAAGT TCATCTCCCT GGGGAAG CAT 
1551 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 

16 01 CGCTTGGCTG CGCAGGAGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 

16 51 ACCGTCTGCG TG AGGAG AT C CTGGCCAAGT TCCTGCACTG GCTGATGAGT 

17 01 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGGAGACCAC 
1751 GTTTCAAAAG AACAGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 
1801 TGCAAAGCAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 

18 51 CTGTCGGAAG CAGAGGTCAG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 
1901 GACGTCCAGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
1951 TGAACATGGA CTACGTCGTG GGAGCCAGAA CGTTCCGCAG AGAAAAGAGG 
2 001 GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 
2 0 51 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACGATATCCA CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 
2151 GACCCGCCGC CTGAGCTGTA CTTTGTCAAG GTGGATGTGA CGGGCGCGTA 
2201 CGACACCATC CCCCAGGACA GGCTCACGGA GGTCATCGCC AG CAT CAT C A 
2 2 51 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 
23 01 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 

23 51 AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 

24 01 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
24 51 GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 



m 

i tjcs 
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2 501 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 

2 551 GCTCCATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 

2601 AACAAG CTGT TTG CGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 

I'* 2651 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 

□ 2701 TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 

O 21 SI CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGG C AC 

iff 2 8 01 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 

t 'g 2 851 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 

2 901 CGGACCTCCA TCAGAGCCAG TCTCACCTTC AAC CGCGGCT TCAAGGCTGG 

2 951 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 



a 
m 



\ ru 



} 3 001 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 

! |«* 3 0 51 AT CTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 

„ 3101 GCAGCTCCCA TTTCATCAGC AAGTTTGGAA GAACCCCACA TTTTTCCTGC 

3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 

3 201 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 

3 251 CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 

O 3 3 01 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG G AC AG C C C AG 

l,J 3 3 51 ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 

j'3 34 01 CGCAGCCAAC CCGGCACTGC CCTCAGACTT CAAGACCATC CTGGACTGAT 

3 4 51 GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACACCAGCA GCCCTGTCAC 

3 5 01 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 

3 551 AC CGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 

3 601 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 

3 651 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 

3 701 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT C AC CAGGAG C CCGGCTTCCA 

3 751 CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 

3 801 GCCCTGCCCT CCTTTGCCTT CCACCCCCAC CATCCAGGTG GAGACCCTGA 

3 851 GAAGGACCCT GGGAGCT CTG GGAATTTGGA GTG AC CAAAG GTGTGCCCTG 

3 901 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGT CAAATT 

3 951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 

4 001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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^GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCC 
CGTCGCGACGCAGGACGACGCGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGG 



AALRPAAHVGS PGPGKPRDA 
QRCVLLRTWEALAPATPAMP- 
SAASCCARGKPWPRP PPRCR- 

GCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCT 
CGCGCGAGGGGCGACGGCTCGGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGA 1 * 



ARS PLPSRALPAAQ PLPRGA 
RAPRCRAVRSLLRSHYREVL 
ALPAAEPCAPCCAATTARCC 



GCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGG 
CGGCGACCGGTGCAAGCACGCCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCC 



AAGHVRAA PGA PGLAAGAAR 
PLATFVRRLGPQGWRLVQRG 
RWPRSCGAWGPRAGGWCSAG 



GGACCCGGCGGCTTTCCGCGCGNTGGTGGCCCANTGCNTGGTGTGCGTGCCCTGGGANGN 
CCTGGGCCGCCGAAAGGCGCGCNACCACCGGGTNACGNACCA(^CGCACGGGACCCTNCN 



GPGGFPR?GGP??GVRALG? 
DPAAFRA?VA?C?VCVPW^>'? - 
TRRLSARWWP ?AWCACPG ? "?- 

ANGGCNGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAANGANCTGGTGGC 
241 - — + ~------ + + + + + 3Q0 

TNCCGNCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACAGGACGGACTTNCTNGACCACCG 

?AAPRRPLLPPGVLPE??GG 
??PPAAPSFRQVSCL??LVA- 
G ?PPPPPPSARCPA* ? ? W W P - 



CCGAGTGCTGCANANGCTGTGCGANCGCGGCGCGAANAACGTGCTGGCCTTCGGCTTCGC 
GGCTCACGACGTNTNCGACACGCTNGCGCCGCGCTTNTTGCACGACCGGAAGCCGAAGCG 



PSAA?AVR?RRE ? RAGLRLR 
RVL? ?LC?RGA?NVLAFGFA 
ECC??CA?AAR?TCWPSASR 

GCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTA 

CGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCGGAAGTGGTGGTCGCACGCGTCGAT 

AAGRGPRGPPRGLHHQRAQL- 
LLDGARGGP PEAFTTSVRSY« 
CWTGPAGAPPRPSPPACAAT 

CCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCG 

GGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTCGCCCCGCACCCCCGACGACGACGC 



PAQHGDRRTAGERGVGAAAA 
LPNTVTDALRGSGAWGLLLR 
C P T R * PTHCGGAGRGGCCCA 
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PRGRRRAGS PAGTLR? ? C A G 
RVGDDVLVHLLARCA? F V L V - 
AWATTCWFTCWHAAR ? LCWW- 

GGOTCCCAGCTGCGCCTACCANGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCNAC 

541 +• + + + + + 600 

CCNAGGGTCGACGCGGATGGTNCACACGCCCGGCGGCGACATGGTCGAGCCGCGACGNTG 

GSQLRLP?VRAAAVPARRC? 
? PSCAY7VCGPPLYQLGAAT- 
? PAAPT? CAGRRCTS SAL? L - 

TCAGGCCCGGCCCCCGCCACACGCTANTGGACCCGAANGCGTCTGGGATCCAACGGGCCT 
60! + + + + + + 660 

AGTCCGGGCCGGGGGCGGTGTGCGATNACCTGGGCTTNCGCAGACCCTAGGTTGCCCGGA 

SGPAPATR?WTR? RLGSNGP 
QARPPPHA?GPE?VWDPTGL- 
RPGPRHTL?DP?ASGIQRAW- 

GGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCTGCCAGCCCCGGGTGCGAGGAG 
661 + + + + + + 720 

CCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGACCCGACGGTCGGGGCCCACGCTCCTC 

GTIASGRPGSPWAAS PGCEE 
E P * RQGGRGPPGLPAPGARR- 
NHS VREAGVPLGCQ PRVRGG- 

GCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGC 

721 + + + + + + 780 

CGCGCCCCCGTCACGGTCGGCTTCAGACGGCAACGGGTTCTCCGGGTCCGCACCGCGACG 

ARGQCQPKSAVAQEAQAWRC 
RGGSASRSLPLPKRPRRGAA- 
AGAVPAEVCRCPRG PGVALP- 

CCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCC 
781 + + + + + + 840 

GGGACTCGGCCTCGCCTGCGGGCAACCCGTCCCCAGGACCCGGGTGGGCCCGTCCTGCGG 

P *AGADARWAGVLG P PGQDA 
PEPERTPVGQGSWAHPGRTP- 
LSRSGRPLGRGPGPTRAGRL- 

TGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCAC 
841 + + + + + + 900 

ACCTGGCTCACTCX5CACCAAAGACACACCACAGTGGACGGTCTGGGCGGCTTCTTCGGTG 

W T E * PWFLCGVTCQTRRRSH 
GPS DRGFCVVS PARPAEEAT- 
DRVTVVSVWCHLPDPPKKPP- 

CTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCA 

901 + + + + + + '960 

GAGAAACCTCCCACGCGAGAGACCGTGCGCGGTGAGGGTGGGTAGGCACCCGGCGGTCGT ' 

LFGGCALWHAPLPPIRGPPA 
S L EGALSGTRHS h p S VGRQH - " , 
LWRVRSLARATPTH PWAAST- 

CCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCTGGGACACGCCTTGTCCCCCG 

961 + - + - + + + * 1020 

GGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGGTGCAGGACCCTGTGCGGAACAGGGGGC 
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a PRGPPIHIAATTSWDTPCPP 

b HAGP PSTSRPPRPGTRLVPR- 

C TRAPH PHRGHHVLGHALS P G - 

GTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGNACACTGCGNCCCTC 

1021 + + + + + + 1080 

CACATGCGGCTCTGGTTCGTGAAGGAGATGAGGAGTCCGCTGTTCNTGTGACGCNGGGAG 

a VYAETKHFLYSSGDK?TA?L 

b CTPRPSTSSTPQAT7TLRPS- 

C VRRDQ'AL PLLLRRQ ? HC ? P P - 

CTT CCT ACTCAAT AT AT CTG AGGC CCAGCCTGACTGG CGTT CGGG AGGTT CGTGGAGACA 

1081 + + + + + 1140 

GAAGGATGAGTTATATAGACTCCGGGTCGGACTGACCGCAAGCCCTCCAAGCACCTCTGT 

| j, a LPTQYI*GPA*LAFGRFVET 

;« b FLLNISEAQPDWRSGGSWR? 

5 ;f C SYS IYLRPSLTGVREVRGD?- 

13 

! B OTCTTTCTGGTTCCAGGCCTTGGATGCCAGGATTCC^ 

'pa 1141 + + - + + + + 1200 

! S NAGAAAGACCAAGGTCCGGAACCTACGGTCCTAAGGGGCGTCCAACGGGGCGGACGGGGT 

LIS 

|«* a ?FLVPGLGCQDSPQVAPPAP 

i. A b SFWFQALDARI PRRLPRLPQ- 
c LSGSRPWMPGFPAGCPACPS- 

! 3 GCGOTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGC 

m 1201 + + + + + + 1260 

j ;L: CGCNATGACCGTTTACGCCGGGGAGAAAGACCTCGACGAACCCTTGGTGC 

|bJ a A? LANAAPVSGAAWE PRAVP 

b RYWQMRPLFLELLGNHAQCP 
!;~ c ? TGKCGPCFWSCLGTTRSAP- 

ru 

CTACGGGGTGTTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGG 

1261 + + + + + + 1320 

GATGCCCCACAAGGAGTTCTGCGTGACGGGCGACGCTCGACGCCAGTGGGGTCGTCGGCC 

a LRGVPQDALPAASCGH PSSR 

b YGVFLKTHCPLRAAVTPAAG- 

c TGCSSRRTARCELRS PQQPV- 

TGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGAACACAG 
1321 + + + + + + 1380 

a CLCPGEAPGLCGGPRGGGTQ 

b VCAREKPQGSVAAPEEEEHR- 

c SVPGRS PRALWRPPRRRNTD- 

ACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCT 

1381 + + + + + + 1440 

TGGGGGCAGCGGACCACGTCGACGAGGCGGTCGTGTCGTCGGGGACCGTCCACATGCCGA 

a TPVAWCSCSASTAAPGRCTA 

b PPS PGAAAPPAQQPLAGVRL- 

c PRRLVQLLRQHS S PWQVYGF- 

TCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACG 

1441 + + + + + + 1500 

AGCACGCCCGGACGGACGCGGCCGACCACGGGGGTCCGGAGACCCCGAGGTCCGTGTTGC 
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SCGPACAGWCPQASGAPGTT 
RAGLPAPAGAPRPLGLQAQR 
VRACLRRLVP PGLWGSRHNE- 

AACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCC 

1501 + + + - --- + + + 1560 

TTGCGGCGAAGGAGTCCTTGTGGTTCTTCAAGTAGAGGGACCCCTTCGTACGGTTCGAGA 

NAASSGTPRSSSPWGSMPSS 
TPLPQEHQEVHLPGEACQAL 
RRFLRNTKKF I SLGKHAKLS- 

CGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGACTGCGCITGGCTGCGCAGGA 

1561 + + + + + + 1620 

GCGACGTCCTCGACTGCACCTTCTACrCGCACGCCCTGACGCGAACCGACGCGTCCTCGG 

RCRS * RGR*ACGTALGCAGA 
AAGADVEDERAGLRLAAQE P 
LQELTWKMSVRDCAWLRRSP- 

CAGGGGTTGGCTGTGTTC CGGC CGCAGAGCAC CGT CTGCGTG AGGAGATCCTGGCCAAGT 

1621 + + + + + + 1680 

GTCCCCAACCGACACAAGGCCGGCGTCTCGTGGCAGACGCACTCCTCTAGGACCGGTTCA 

QGLAVFRPQSTVCVRRSWPS 
RGWLCSGRRAPSA* GDPGQV- 
GVGCVPAAEHRLRE E ILAKF- 

TCCTGCACK^CTGATGAGTGTGTACGTOTTCGAGCTGCTC^ 

1681 + + + + + -f- 1740 

AGGACGTGACCGACTACTCACACATGCAGCAGCTCGACGAGTCCAGAAAGAAAATACAGT 

SCTG**VCTSSSCSGLSFMS 
PALADECVRRRAAQVFLLCH- 
LHWLMSVYVVELLRS F F Y V T - 

CGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGG 
174 1 + + + + + + 1800 

GCCTCTGGTGCAAAGTTTTCrrGTCCGAGAAAAAGATGGC CTTCTC ACAGACCTCG^ CA 

RRPRFKRTGS FSTGRVSGAS 
GDHVSKEQALFLPEECLEQV- 
ETTFQKNRLFFYRKSVWSKL- 

TGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAG 

1801 + + + + + + i860 

ACGTTTCGTAACCTTAGTCTGTCGTGAACrrTCTCCCACGTCGACGCCCTCGACAGCCTTC 

CKALESDST*RGCSCGSCRK 
AKHWNQTALEEGAAAGAVGS - 
QS IGIRQHLKRVQLRELSEA- 

CAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCA 

1861 + + + + + + 1920 

GTCTCGAGTCCGTCGTAGCCCTTCGGTCCGGGCGGGACGACTGCAGGTCTGAGGCGAAGT 

QRSGSIGKPGPPC-*RPDSAS 
RGQAASGSQARPADVQT PLH - 
EVRQHREARPALLTSRLRF I- 

TCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAA 

1921 +■ + + + + ---- + 1980 

AGGGGTTCGGACTGCCCGACGCCGGCTAACACTTGTACCTGATGCAGCACCCTCGGTCTT 
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SPSLTGCGRL*TWTTSWEPE 
PQA*RAAADCEKGLRRGSQN- 
PKPDGLRPIVNMDYVVGART- 

CGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCG 

1981 + + + + + + 2040 

GCAAGGCGTCTCTTTTCTCCCGGCTCGCAGAGTGGAGCTCCC^CTTCCGTGACAAGTCGC 

RSAEKRGPSVS PRG * RHCSA 
VPQRKEGRASHLEGSGTVQR- 
FRREKRAERLTS RVKALFSV- 

TGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGG 

2041 + + + + + + 2100 

ACGAGTTGATGCTCGCCCGCGCCGCGGGGCCGGAGGACCCGCGGAGACACGACCCGGACC 

CSTTSGRGAPASWAPLCWAW 
AQLRAGAAPRP PGRLCAGPG- 
LNYERARRPGLLGAS VLGLD- 

ACGATATCC^^GGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGC 

2101 + + + + -f- + 2160 

TGCTATAGGTGTCCCGGACCGCGTGGAAGCACGACGCACACGCCCGGGTCCTGGGCGGCG 

TISTGPGAPSCCVCGPRTRR 
RYPQGLAHLRAACAGPGPAA- 
DIHRAWRTFVLRVRAQDPPP- 

CTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCC^ 

2161 + + + + + + 2220 

GACTCGACATGAAACAGTTCCACCTACACTGCCCGCGCATGCTGTGGTAGGGGGTCCTGT 

LSCTLSRWM* RARTTPSPRT 
* AVLCQGGCDGRVRHH'P PGQ - 
ELYFVKVDVTGAYDT I PQDR- 

GGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGT 

2221 + + + + + + 2280 

CCGAGTGCCTCCAGTAGCGGTCGTAGTAGTTTGGGGTCTTGTGCATGACGCACGCAGCCA 

GSRRSS PASSNPRTRTACVG 
AHGGHRQHHQTPEHVLRASV- 
LTEVIAS I I KPQNTYCVRRY- 

ATGCCGTGGTCCAGAAGGCCGCCC^TGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCT 

2281 - + + + + + + 2340 

TACGGCACCAGGTCTTCCGGCGGGTACCCGTGCAGGCGTTCCGGAAGTTCTCGGTGCAGA 

MPWSRRPPMGTSARPS RATS 
CRGPEGRPWARPQGLQEPRL 
AVVQKAAHGHVRKAFKSHVS- 

CTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGANAACA 
2341 + + + + + ^ + 2400 

gatggaactgtctggaggtcggcatgtacgctgtcaagcaccgagtggacgtcctnttgt 

lp*qtssrtcdsswltcr?'t 
yldrppavhatvrgs p a g ? q - 
tltdlqpymrqfvahlq7ns- 

gcccgctgagggatgccgtcgtcatcgagcagagctcctccctgaatgaggccagcagtg 

2401 + + + + + + 2460 

CGGGCGACTCCCTACGGCAGCAGTAGCTCGTCTCGAGGAGGGACTTACTCCGGTCGTCAC 
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AR*GMPSSSSRAPP*MRPAV 
PAEGCRRHRAELLPE + GQQW- 
PLRDAVVIEQSSSLNEASSG- 

GCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGT 

2461 + + + + + + 2520 

CGGAGAAGCTGCAGAAGGATGCGAAGTACACGGTGGTGCGGCACGCGTAGTCCCCGTTCA 

ASSTSSYASCATTPCASGAS 
PLRRLPTLHVPPRRAHQGQV- 
LFDVFLRFMCHHAVRIRGKS- 

CCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCT 

2521 + + + + + + 2580 

GGATGCAGGTCACGGTCCCCTAGGGCGTCCCGAGGTAGGAGAGGTGCGACGAGACGTCGG 

PTSSARGSRRAPSSPRCSAA 
LRPVPGDPAGLHPLHAALQP- 
YVQCQG I P Q G S I L S T L L CS L - 

TGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCC 

2581 -- + + + + + + 2640 

ACACGATGCCGCTGTACCTCTTGTTCGACAAACGCCCCTAAGCCGCCCTGCCCGACGAGG 

CATATWRTSCLRGFGGTGCS 
VLRRHGEQAVCGDSAGRAAP 
CYGDMENKLFAGI RRDGLLL- 

TGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACC 

2641 + + + - + + + 2700 

ACGCAAACCACCTACTAAAGAACAACCACTGTGGAGTGGAGTGGGTGCGCTTTTGG^ 

CVWWMISCW*HLTSPTRKPS 
AFGG* FLVGDTS PHPRENLP 
RLVDDFLLVTPHLTHAKTFL- 

TCAGGAC C CTGGTCCGAGGTGT C CCTGAGT ATGGCTGCGTGGTGAACTTGCGGAAGACAG 

2701 - + + --- + + + + 2760 

AGTCCTGGGACCAGGCTCCACAGGGACTCATACCGACGCACCACTTGAACGCCTTCTGTC 

SGPWSEVSLSMAAW*TCGRQ 
QDPGPRCP*VWLRGELAEDS 
RTLVRGVPEYGCVVNLRKTV- 

TGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGG 

2761 + + -- + + + + 2820 

ACCACTTGAAGGGACATCTTCTGCTCCGGGACCCACCGTGCCGAAAACAAGTCTACGGCC 

W*TSL* KTRPWVARLLFRCR 
GELPCRRRGPGWHGFCSDAG- 
VNFPVEDEALGGTAFVQ MPA- 

CCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGA 

2821 + + + + + * + 2880 

GGGTGCCGGATAAGGGGACCACGCCGGACGACGACCTATGGGCCTGGGACCTCCACGTCT 

PTAYSPGAACCWIPGPWRCR 
PRPI PLVRPAAGYPD PGGAE 
HGLFPWCGLLLDTRT LEVQS- 

GCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCT 

2881 + + + + + T 2940 

CGCTGATGAGGTCGATACGGGCCTGGAGGTAGTCTCGGTCAGAGTGGAAGTTGGCGCCGA 
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ATTPAMPGPPSEPVSPSTAA 
RLLQLCPDLHQSQSHLQ PRL- 
DYSSYARTS I RAS LTFNRGF- 

TCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTC 

2941 + + +. + + +. 3000 

AGTrCCGACCCTCCTTGTACGCAGCGTTTGAGAAACCCCAGAACGCCGACTTCACAGTGT 

SRLGGTCVANSLGS CG* SVT 
QGWEEHASQTLWGLAAEVSQ- 
KAGRNMRRKLFGVLRLKCHS- 

GCCTGTTTCTGGATTTGCAGGTGAACAGCCrCCAGACGGTGTGCACC^ 

3001 + - + + + + + 3060 

CGGACAAAGACCT AAACGTC CACTTGTCGG AGGTCTGC CACACGTGGTTGT AGATGTTCT 

ACFW ICR*TASRRCAPTSTR 
PVSGFAGEQPPDGVHQHLQD 
LFLDLQVNSLQTVCTNI Y K I - 

TCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTC 

3061 + -- + + + + + 3120 

AGGAGGACGACGTCCGCATGTCCAAAGTGCGTACACACGACGTCGAGGGTAAAGTAGTCG 

SSCCRRTGFTHVCCSSHFIS 
PPAAGVQVS RMCAAAP I S S A 
LLLQAYRFHACVLQL PFHQQ- 

AAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCT 

3121 + + + + + + 318O 

TTGAAACCTTCTTGGGGTGTAAAAAGGACGCGCAGTAGAGACTGTGCCGGAGGGAGAC^ 

KFGRTPHFSCASSLTRPPSA 
SLEEPHI FPARHL*HGLPLL 
VWKNPTFFLRVI S DTAS L C Y - 

ACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCC 

3181 + + + + + + 3240 

TGAGGTAGGACTTTCGGTTCTTGCGTCCCTACAGCGACCCCCGGTTCCCGCGGCGGCCGG 

TPS * KPRTQGCRWGPRAPPA 
LHPESQERRDVAGGQGRRRP- 
S I L KAKNAGMS LGAKGAAG P - 

CTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGCTGACTC 

3241 + + + + + + 3300 

GAGACGGGAGGCTCCGGCACGT CACCGACACGGTGGTT CGT AAGG ACGAGTT CGACTGAG 

LCPPRPCSGCATKHSCSS*L 
SALRGRAVAVP PS I PAQADS - 
LPS EAVQWLCHQA FLLKLTR- 

GACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGCAGCTGA 

3301 + + + + + + 3360 

CTGTGGCACAGTGGATGCACGGTGAGGACCCCAGTGAGTCCTGTCGGGTCTGCGTCGACT 

DTVS PTCHSWGHSGQPRRS * 
TPCHLRATPGVTQDSPDAAE - 
HRVTYVPLLGS LRTAQTQLS- 

GTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGGCACTGC 

3361 + -- + + + + + 3420 

CAGCCTTCGAGGGCCCCTGCTGCGACTGACGGGACCTCCGGCGTCGGTTGGGCCGTGACG 
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VGSSRGRR* LPWRPQPTRHC 
SEAPGDDADCPGGRSQPGTA- 
RKLPGTTLTALEAAANPALP- 

C CT CAGACTT CAAGAC CATC CTGGACTGATGGC CAC C CGC C CACAGC CAGGC CGAGAGCA 

3421 + + + + - + + 3480 

GGAGTCTGAAGTTCTGGTAGGACCTGACTACCGGTGGGCGGGTGTCGGTCCGGCTCTCGT 

PQTSRPSWTDGHPPTARPRA 
LRLQDHPGLMATRPQ PGRE'Q- 
SDFKTILD*WPPAHSQAESR- 

GACACGAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACAC 

3481 + + + + + + 3540 

CTGTGGTCGTCGGGACAGTGCGGCCCGAGATGCAGGGTCCCTCCCTCCCCGCCGGGTGTG 

DTSS PVTPGSTSQGGRGGPH 
TPAALSRRALRPREGGAAHT- 
HQQPCHAGLYVPGREGRPTP- 

3541 + + + + -- + + 3600 

GGTCCGGGCGTGGCGACCCTCAGACTCCGGACTCACTCAGAAACCGGCT 

PGPHRWESEA*VSVWPRPAC 
QARTAGSLRPE*VFGRGLHV- 
RPAPLGV*GLSECLAEACMS- 

CCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGC 

3601 + + + + + + 3660 

GGCCGACrTCCGACTCACAGGCCGACTCOSGACTCGCTCAC^ 

PAEG*VSG*GLSECPAKG*V 
RLKAECPAEA*ASVQPRAEC- 
G*RLSVRLRPERVSSQGLSV- 

TCCAGCACACCTGCCGTCTTCACTTCCCCA(^GGCTGGCGCTCGGCTCCACCCCAGGGCC 

3661 + + + + + + 3720 

AGGTCGTGTGGACGGCAGAAGTGAAGGGGTGTCCGACCGCGAGCCGAGGTGGGGTCCCGG 

SSTPAVFTS PQAGARLHPRA 
PAHLPSSLPHRLALGSTPGP 
QHTCRLHFPTGWRSA P PQGQ- 

AGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCC^ 

3721 + + + + + + 3780 

TCGAAAAGGAGTGGTCCTCGGGCCGAAGGTGAGGGGTGTATCCTTATCAGGTAGGGGTCT 

SFSSPGARLPLPT*E*SIPR 
AFPHQEPGFHS PHRNSPSPD 
LFLTRS PASTPH IG I VHPQI- 

TTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCAC 

3781 + + + + + + 3840 

AAGCGGTAACAAGTGGGGAGCGGGACGGGAGGAAACGGAAGGTGGGGGTGGTAGGTCCAC 

FAIVHPSPCPPLPSTPTIQV 
SPLFTPRPALLCLPPPPSRW- 
RHCS PLALPS FAFH PHHPGG- 

GAG AC CCTGAGAAGGAC C CTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGC CCTG 

3841 + + +. + + + 3900 

CTCTGGGACTCTTCCTGGGACCCTCGAGACCCTTAAACCTCACTGGTTTCCACACGGGAC 
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a ETLRRTLGALGIWSDQRCAL 

b RP*EGPWELWEFGVTKGVPC- 

c DPEKDPGSSGNLE*PKVCPV- 

TACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAAT^ 

3901 + + + + + + 3960 

ATGTGTCCGCTCCTGGGACGTGGACCTACCCCCAGGGACACCCAGTTTAACCCCCCTCCA 



\0, a YTGEDPAPGWGSLWVKLGGG 

i . b TQARTLHLDGGPCGSNWGEV- 

'■' C HRRGPCTWMGVPVGQ IGGRC- 



I ; g GCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGAAAAAAAA 

3961 + + + + + + 4020 

CG ACAC C CTCATTTT ATGACTT ATATACTCAAAAAGT CAAAACTTTTTTTTTTTTTTTTT 



a AVGVKY* IYEFFSFEKKKKK 

b LWE*NTEYMSFSVLKKK KKK- 

c CGSKILNI*VFQF*KKKKK K 

AAAAAAAAA 

4021 4029 

TTTTTTTTT 

a K K K 

b K K 

c K K 
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i 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 
phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 . 90 

leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp -gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 

130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 

140 150 
trp gly leu leu leu arg arg val gly asp asp val leu val his 
TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 
CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

17 0 18 0 

ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 
GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 
ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 
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leu 
CTG 


gly 
GGA 


cys 
TGC 


glu 
GAA 


arg 
CGG 


val 
GTC 


pro 
CCC 


leu 
CTG 


gly 
GGC 


leu 
CTG 


ser 
AGT 


ala 
GCC 


ser 
AGC 


230 
arg 
CGA 


ser 
AGT 


ala 
GCT 


ala 

GCC 


pro 
CCT 


glu 
GAG 


pro 
CCG 


ala 

GCC 


his 
CAC 


pro 
CCG 


260 
gly 
GGC 


arg 
AGG 


val 
GTG 

gly 
GGT 


val 
GTG 

ala 
GCG 


ser 
TCA 

leu 
CTC 


pro 
CCT 
2 90 
ser 
TCT 


ala 
GCC 

gly 
GGC 


gin 
CAG 


his 
CAC 


his 
CAC 


ala 
GCG 


gly 
GGC 


trp 
TGG 


asp 
GAC 


thr 
ACG 


3 20 
pro 
CCT 


cys 
TGT 


leu 
CTC 


tyr 
TAC 


ser 
TCC 


ser 
TCA 


gly 
GGC 


leu 
CTC 


ser 
AGC 


ser 
TCT 


350 
leu 
CTG 


arg 
AGG 


glu 
GAG 


thr 
ACC 


ile 
ATC 


phe 
TTT 


leu 
CTG 


arg 
CGC 


arg 
AGG 


leu 
TTG 


380 
pro 
CCC 


arg 
CGC 


leu 
CTG 


phe 
TTT 


leu 
CTG 


glu 
GAG 


leu 
CTG 


val 
GTG 


leu 
CTC 


leu 
CTC 


410 
lys 
AAG 


thr 
ACG 



ala trp asn his ser 
GCC TGG AAC CAT AGC 

220 

pro ala pro gly ala 
CCA GCC CCG GGT GCG 



leu pro leu pro lys 
CTG CCG TTG CCC AAG 

250 

glu arg thr pro val 
GAG CGG ACG CCC GTT 



thr arg gly pro ser 
ACG CGT GGA CCG AGT 

280 

arg pro ala glu glu 
AGA CCC GCC GAA GAA 

thr arg his ser his 
ACG CGC CAC TCC CAC 

310 

pro pro ser thr ser 
CCC CCA TCC ACA TCG 



pro pro val tyr ala 
CCC CCG GTG TAC GCC 

340 

asp lys glu gin leu 
GAC AAG GAG CAG CTG 



pro ser leu thr gly 
CCC AGC CTG ACT GGC 

370 

gly ser arg pro trp 
GGT TCC AGG CCC TGG 



leu pro gin arg tyr 
CTG CCC CAG CGC TAC 

400 

leu gly asn his ala 
CTT GGG AAC CAC GCG 



his cys pro leu arg 
CAC TGC CCG CTG CGA 
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210 

val arg glu ala gly 
GTC AGG GAG GCC GGG 



arg arg arg gly gly 
AGG AGG CGC GGG GGC 

240 

arg oro arg arg gly 
AGG CCC AGG CGT GGC 



gly gin gly ser trp 
GGG CAG GGG TCC TGG 

270 

asp arg gly phe cys 
GAC CGT GGT TTC TGT 



ala chr ser leu glu 
GCC ACC TCT TTG GAG 

300 • 

pro ser val gly arg 
CCA TCC GTG GGC CGC 



arg pro pro arg pro 
CGG CCA CCA CGT CCC 

330 

glu thr lys his phe 
GAG ACC AAG CAC TTC 



arg pro ser phe leu 
CGG CCC TCC TTC CTA 

360 

ala arg arg leu val 
GCT CGG AGG CTC GTG 



met pro gly thr pro 
ATG CCA GGG ACT CCC 

390 

trp gin met arg pro 
TGG CAA ATG CGG CCC 



gin cys pro tyr gly 
CAG TGC CCC TAC GGG 

420 

ala ala val thr pro 
GCT GCG GTC ACC CCA 
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ala ala gly 
GCA GCC GGT 



ala pro glu 
GCC CCC GAG 



leu arg gin 
CTC CGC CAG 



ala cys leu 
GCC TGC CTG 



his asn glu 
CAC AAC GAA 



leu gly lys 
CTG GGG AAG 



met ser val 
ATG AGC GTG 



gly cys val 
GGC TGT GTT 



ala lys phe 
GCC AAG TTC 



leu arg ser 
CTC AGG TCT 



arg leu phe 
AGG CTC TTT 



ile gly ile 
ATT GGA ATC 

ser glu ala 
TCG GAA GCA 



leu thr ser 
CTG ACG TCC 



pro ile val 
CCG ATT GTG 



val cys ala 
GTC TGT GCC 

440 

glu glu asp 
GAG GAG GAC 



his ser ser 
CAC AGC AGC 

470 

arg arg leu 
CGC CGG CTG 



arg arg phe 
CGC CGC TTC 

500 

his ala lys 
CAT GCC AAG 



arg asp cys 
CGG GAC TGC 

530 

pro ala ala 
CCG GCC GCA 



leu his trp 
CTG CAC TGG 

560 

phe phe tyr 
TTC TTT TAT 



phe tyr arg 
TTC TAC CGG 

590 

arg gin his 
AGA CAG CAC 

glu val arg 
GAG GTC AGG 

620 

arg leu arg 
AGA CTC CGC 



asn met asp 
AAC ATG GAC 



430 
arg glu lys 
CGG GAG AAG 



thr asp pro 
ACA GAC CCC 

460 

pro trp gin 
CCC TGG CAG 



val pro pro 
GTG CCC CCA 

490 

leu arg asn 
CTC AGG AAC 



leu ser leu 
CTC TCG CTG 

520 

ala trp leu 
GCT TGG CTG 



glu his arg 
GAG CAC CGT 

550 

leu met ser 
CTG ATG AGT 



val thr glu 
GTC ACG GAG 

580 

lys ser val 
AAG AGT GTC 



leu lys arg 
TTG AAG AGG 
610 

gin his arg 
CAG CAT CGG 



phe ile pro 
TTC ATC CCC 

640 

tyr val val 
TAC GTC GTG 



pro gin gly 
CCC CAG GGC 



arg arg leu 
CGT CGC CTG 



val tyr gly 
GTG TAC GGC 



gly leu trp 
GGC CTC TGG 



thr lys lys 
ACC AAG AAG 



gin glu leu 
CAG GAG CTG 



arg arg ser 
CGC AGG AGC 



leu arg glu 
CTG CGT GAG 



val tyr val 
GTG TAC GTC 



thr thr phe 
ACC ACG TTT 



trp ser lys 
TGG AGC AAG 



val gin leu 
GTG CAG CTG 

glu ala arg 
GAA GCC AGG 



lys pro asp 
AAG CCT GAC 



gly ala arg 
GGA GCC AGA 



ser val ala 
TCT GTG GCG 

450 

val gin leu 
GTG CAG CTG 



phe val arg 
TTC GTG CGG 

480 

gly ser arg 
GGC TCC AGG 



phe ile ser 
TTC ATC TCC 

510 

thr trp lys 
ACG TGG AAG 



pro gly val 
CCA GGG GTT 

540 

glu ile leu 
GAG ATC CTG 



val glu leu 
GTC GAG CTG 

570 

gin lys asn 
CAA AAG AAC 



leu gin ser 
TTG CAA AGC 

600 

arg glu leu 
CGG GAG CTG 

pro ala leu 
CCC GCC CTG 

630 

gly leu arg 
GGG CTG CGG 



thr phe arg 
ACG TTC CGC 
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650 660 
arg glu lys arg ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 690 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 
ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

710 720 
tyr phe val lys val asp val thr gly ala tyr asp thr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

730 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 

740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser his val ser thr leu 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG 

770 780 
thr asp leu gin pro tyr met arg gin phe val ala his leu gin 
ACA GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG 

790 

glu thr ser pro leu arg asp ala val val ile glu gin ser ser 
GAG ACC AGC CCG CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC 

800 810 
ser leu asn glu ala ser ser gly leu phe asp val phe leu arg 
TCC CTG AAT GAG GCC AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC 

820 

phe met cys his his ala val arg ile arg gly lys ser tyr val 
TTC ATG TGC CAC CAC GCC GTG CGC ATC AGG GGC AAG TCC TAC GTC 

830 840 
gin cys gin gly ile pro gin gly ser ile leu ser thr leu leu 
CAG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC 

850 

cys ser leu cys tyr gly asp met glu asn lys leu phe ala gly 
TGC AGC CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG 

860 870* 
ile arg arg asp gly leu leu leu arg leu val asp asp phe leu 
ATT CGG CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG 
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880 

leu val thr pro his leu thr his ala lys thr phe leu arg thr 

TTG GTG ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC 

890 900 

leu val arg gly val pro glu tyr gly cys val val asn leu arg 

CTG GTC CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG 

910 



lys 


thr 


val 


val 


asn 


phe 


pro 


val 


glu 


asp 


glu 


ala 


leu 


gly 


gly 


AAG 


ACA 


GTG 


GTG 


AAC 


TTC 


CCT 


GTA 


GAA 


GAC 


GAG 


GCC 


CTG 


GGT 


GGC 








920 




















930 




thr 


ala 


phe 


val 


gin 


met 


pro 


ala 


his 


gly 


leu 


phe 


pro 


trp 


cys 


ACG 


GCT 


TTT 


GTT 


CAG 


ATG 


CCG 


GCC 


CAC 
940 


GGC 


CTA 


TTC 


CCC 


TGG 


TGC 


gly 


leu 


leu 


leu 


asp 


thr 


arg 


thr 


leu 


glu 


val 


gin 


ser 


asp 


tyr 


GGC 


CTG 


CTG 


CTG 


GAT 


ACC 


CGG 


ACC 


CTG 


GAG 


GTG 


CAG 


AGC 


GAC 


TAC 








950 




















960 




ser 


ser 


tyr 


ala 


arg 


thr 


ser 


ile 


arg 


ala 


ser 


leu 


thr 


phe 


asn 


TCC 


AGC 


TAT 


GCC 


CGG 


ACC 


TCC 


ATC 


AGA 


GCC 


AGT 


CTC 


ACC 


TTC 


AAC 



970 

arg gly phe lys ala gly arg asn met arg arg lys leu phe gly 
CGC GGC TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG 

980 990 
val leu arg leu lys cys his ser leu phe leu asp leu gin val 
GTC TTG CGG CTG AAG TGT CAC AGC CTG „ TTT CTG GAT TTG CAG GTG 

1000 

asn ser leu gin thr val cys thr asn ile tyr lys ile leu leu 
AAC AGC CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG 

1010 1020 
leu gin ala tyr arg phe his ala cys val leu gin leu pro phe 
CTG CAG GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT 

1030 

his gin gin val trp lys asn pro thr phe phe leu arg val ile 
CAT CAG CAA GTT TGG AAG AAC CCC ACA TTT TTC CTG CGC GTC ATC 

1040 1050 
ser asp thr ala ser leu cys tyr ser ile leu lys ala lys asn 
TCT GAC ACG GCC TCC CTC TGC TAC TCC ATC CTG AAA GCC AAG AAC 

1060 

ala gly met ser leu gly ala lys gly ala ala gly pro leu pro 
GCA GGG ATG TCG CTG GGG GCC AAG GGC GCC GCC GGC CCT CTG CCC 

1070 1080 
ser glu ala val gin trp leu cys his gin ala phe leu leu lys 
TCC GAG GCC GTG CAG TGG CTG TGC CAC CAA GCA TTC CTG CTC AAG 

1090 

leu thr arg his arg val thr tyr val pro leu leu gly ser leu 
CTG ACT CGA CAC CGT GTC ACC TAC GTG CCA CTC CTG GGG TCA CTC 
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1100 1110 
arg thr ala gin thr gin leu ser arg lys leu Dro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu thr ala leu glu ala ala ala asn pro ala leu pro ser asp 
CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe lys thr ile leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGG C C AC C CG C C C AC AG C C AGG C C G AG AGCAG A 
CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 
; % AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

□ GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 

□ 

|jj CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

% CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

n 

M CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 
GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 
!2 CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 

1 3 TGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGRAAAAAAAAAAAAAAAAAA 

i.J 

AAAAAAAAAA 

in 
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